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ORIGINAL ARTICLE 

Transcriptional targets of the schizophrenia risk gene MIR137 

AL Collins 1 , Y Kim 1 , RJ Bloom 1 , SN Kelada 1 , P Sethupathy 1 and PF Sullivan 1 ' 2 

Genome-wide association studies (GWAS) have strongly implicated MIR137 (the gene encoding the microRNA miR-137) in 
schizophrenia. A parsimonious hypothesis is that a pathway regulated by miR-137 is important in the etiology of schizophrenia. Full 
evaluation of this hypothesis requires more definitive knowledge about biological targets of miR-137, which is currently lacking. Our 
goals were to expand knowledge of the biology of miR-137 by identifying its empirical targets, and to test whether the resulting 
lists of direct and indirect targets were enriched for genes and pathways involved in risk for schizophrenia. We overexpressed 
miR-137 in a human neural stem cell line and analyzed gene expression changes at 24 and 48 h using RNA sequencing. Following 
miR-137 overexpression, 202 and 428 genes were differentially expressed after 24 and 48 h. Genes differentially expressed at 24 h 
were enriched for transcription factors and cell cycle genes, and differential expression at 48 h affected a wider variety of pathways. 
Pathways implicated in schizophrenia were upregulated in the 48 h findings (major histocompatibility complex, synapses, FMRP 
interacting RNAs and calcium channels). Critically, differentially expressed genes at 48 h were enriched for smaller association P- 
values in the largest published schizophrenia GWAS. This work provides empirical support for a role of miR-137 in the etiology of 
schizophrenia. 
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INTRODUCTION 

Schizophrenia is now known to be highly polygenic with a genetic 
architecture characterized by contributions from dozens and 
probably hundreds of different genetic loci. Although rare 
deleterious exonic and copy number variation (CNVs) have roles, 
numerous common variants of more subtle effect account for 
sizable fractions of the heritability of schizophrenia. 1-4 

A replicated and robust association with schizophrenia is in 
MIR137, the gene encoding the microRNA miR-137. This region 
was initially identified by the Psychiatric Genomics Consortium 
(P= 1.6x 10 -11 ). 5 In the largest genome-wide association study 
(GWAS) yet conducted (by the Psychiatric Genomics Consortium, 
36 989 cases), the second strongest finding was within MIR137 
(P = 3.4x 10 -19 ). 1 Pathway analyses have suggested that genes 
containing predicted miR-137 target sites are significantly 
enriched for smaller P-values for association with 
schizophrenia. 5-7 This enrichment is not robustly identified in all 
studies, 1 and one possibility is that miR-137 target predictions are 
not sufficiently accurate. 

It is possible that a pathway regulated by miR-137 is important 
in the etiology of schizophrenia. microRNAs can target large 
numbers of mRNAs for degradation or translational inhibition, and 
a single microRNA can have major effects on cellular pathways. 8-11 
miR-137 has important roles in neural development 12-15 and has 
been implicated in autism, intellectual disability, Rett syndrome, 
Alzheimer's disease, Huntington's disease and multiple 
cancers. 14 ' 16-28 

Schizophrenia GWAS results, pathway analyses and the roles of 
miR-137 in development and disease are consistent with the 
broad hypothesis that genetic variation in miR-137 and its targets 
constitute a pathway whose dysfunction is involved in the 
etiology of schizophrenia. There are several major limitations in 
evaluating this hypothesis. A key issue is that miR-137 target 



genes are mostly based on computational algorithms of imperfect 
sensitivity and specificity. These algorithms predict only the direct 
targets of miR-137. 29,30 Some direct targets of miR-137 have been 
verified 12-1 4,1 8,25-28,31 32 including genes implicated in schizophre- 
nia (CACNA1C, TCF4, CSMD1, C10orf26 and ZNF804A). 33,34 However, 
many of these targets were identified in cancers, and their 
neuronal role is unclear. miRNA regulatory activity is complex and 
can include regulatory loops, post-translational modifications and 
non-canonical target sequences (for example, in coding regions 
and not 3' untranslated regions). 15,35,36 The 'downstream' effects 
of miR-137 (that is, its impact beyond a directly targeted 
transcript) have not been systematically evaluated. 

We thus evaluated the effects of miR-137 overexpression in a 
human neuronal cell line. Our goal was to expand knowledge of 
the biology of miR-137 by identifying its empirical targets. We 
transduced neuronal stem cells with vectors expressing miR-137 
or a control miRNA mimic and used high-throughput sequencing 
(RNA-seq) at 24 and 48 h after transduction to assess the effects of 
miR-137 overexpression. Overexpression of miR-137 had an 
impact on multiple cell cycle genes, consistent with other findings 
of effects on proliferation and downregulation in cancer. However, 
we also found that the indirect effects of miR-137 overexpression 
at 48 h had strong connections with GWAS findings for schizo- 
phrenia. These results provide functional support for a role of 
miR-137 in the etiology of schizophrenia. 



MATERIALS AND METHODS 

Cell line 

ReNcell-VM (Millipore, Billerica, MA, USA) is a human neural precursor cell 
line derived from the ventral mesencephalon. 37 We chose this cell line 
because it was readily available, accessible to our methods and not derived 
from a cancer. Moreover, we genotyped two different clones with lllumina 
OminExpress arrays and confirmed the expected male sex, European 
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ancestry, the absence of aneuploidy and the absence of CNV larger than 
> 100 kb that affected a gene. 

Infection of cells 

ReNcell-VMs were grown on laminin-coated plates in ReNcell NSC 
Maintenance Media (Millipore) with 20ngmr 1 FGF-2 and 20ngmr 1 
epidermal growth factor. We used pEZX-MR03 lentiviral vectors with 
miR-137 or a control packaged with an HIV-1 -compatible system 
(GeneCopoeia, Rockville, MD, USA). Infection efficiency was assessed by 
microscopic evaluation of eGFP expression (contained in the pEZX-MR03 
plasmid). To evaluate nonspecific effects because of overloading of the 
miRNA-processing machinery, we used a control lentiviral vector contain- 
ing a miRNA mimic (that is, a random miR-like sequence of no similarity to 
any known miRNA). Flasks of cells were infected in parallel with the 
miR-137 or control lentivirus. Viral amounts were titrated to achieve 
-100% infection rates without evidence of toxicity. For the 48-h time 
point, the cells were exposed to virus for 24 h and then replaced with fresh 
media. Cells were collected 24 or 48 h after infection, stored in RNA- 
stabilizing media (RNA/afer, Qiagen, Venlo, The Netherlands) and frozen at 
-80°C until analysis. Evaluation of two relatively early time points was 
consistent with our goal of identifying the immediate effects of miR-137 
overexpression as these are likely to be enriched for genes targeted by 
miR-137. 

RNA extraction 

Total RNA was extracted using either Maxwell 16 LEV Simply RNA kit 
(Madison, Wl, USA; 24 h RNA-seq) or Norgen Biotek RNA/DNA/Protein Kit 
(Thorold, ON, Canada; 48 h RNA-seq and mature miR-1 37 assays). Although 
both methods yield total RNA, the Norgen kit extracts more mature 
microRNAs (data not shown) and is better suited for quantification of 
miR-137. 

Quantitative PCR 

We used TaqMan assays to measure expression of mature miR-137 
(Applied Biosystems, assay ID 001 129; Grand Island, NY, USA). Each sample 
was run in quadruplicate, and StepOne software (v2.1; SAS, Cary, NC, USA) 
was used to evaluate AC t and fold change (RQ) values. The endogenous 
control was RNU6B. Statistical significance of the AC t values was assessed 
using nested analysis of variance (SAS v9.2, PROC GLM). All assays were 
performed on the same RNA samples as were used for RNA-seq except for 
the miR-137 assays at the 24-h time point, which were carried out on 
replicate samples (using RNA extracted with the Norgen kit). 

RNA-seq 

RNA-sequencing libraries were generated using Truseq RNA sample prep 
kits (lllumina, San Diego, CA, USA), which generate libraries from poly- 
adenylated RNAs. Each library was barcoded, pooled and sequenced using 
lllumina HiSeq 2000 and single-end 100-bp reads. Single-end sequencing 
was chosen to maximize read depth and detection of small fold changes. 
There were three miR-137 overexpressions and three controls at 24 h, and 
five miR-137 overexpressions and six controls at 48 h. Read depth 
averaged -55 million reads per replicate at 24 h and -160 million reads 
per replicate at 48 h (see below for explanation of greater numbers of 
reads at 48 h). All procedures were randomized. 

Data processing and analysis 

We used the TopHat-Cufflinks-Cuffdiff pipeline to detect differential gene 
expression between neural stem cells with increased expression of 
miR-137 compared with controls. First, we used FastQC (see URLs) to 
evaluate sequencing quality. As one 48-h sample failed, we repeated RNA- 
seq on all of the 48-h samples. We verified sample identity across all 
experiments via the creation of a vector of genotypes based on high- 
confidence heterozygous exonic RNA-seq reads. RNA-seq reads were also 
aligned to the overexpression sequence (miR-137 or a miR-like mimic) to 
confirm experimental condition. Second, we mapped RNA-seq reads to the 
human reference genome (UCSC hg19) using TopHat2 and transcript 
annotations from the TopHat website (see URLs). 38 Third, the mapped 
reads were assembled into transcripts using Cufflinks (v2.0.1; http:// 
cufflinks.cbcb.umd.edu). 39 Fourth, we pooled the Cufflinks output files 
from multiple RNA-seq samples using Cuffcompare to assemble a 
comprehensive set of transcripts. Finally, we used Cuffdiff 39 to evaluate 



differential gene expression of miR-137 overexpression versus the control 
vector separately at 24 and 48 h with correction for multiple comparisons 
using a false-discovery rate approach (g-value). 40 We used the RNA-seq 
data to evaluate gene expression of transcripts whose protein products are 
known to characterize neuronal differentiation using a read count 
approach (Supplementary Methods). A few genes were reported more 
than once, and the transcript with the smallest P-value was retained. 41 

Additional analyses 

We compared the effects of miR-137 overexpression in neural progenitor 
cells to computational predictions of genes targeted by miR-137. We used 
Sylarray (see URLs) to test all known miRNA seed sequences in the RNA-seq 
gene level data. 42 The analyses are performed at varying cutoffs to create a 
curve of log qo (P) values for each miRNA. We also compared differentially 
expressed genes from miR-137 overexpression to computationally 
predicted miR-137 targets (TargetScan v6.2; http://www.targetscan.org) 43 
with significance determined using permutation. 

DAPPLE was used to evaluate whether the lists of genes had more 
protein-protein interactions than predicted by chance. 44 We used 
DAVID 45,46 to evaluate whether genes differentially expressed upon 
miR-137 overexpression tended to occur in specific biological pathways 
or processes including GO, 47 KEGG 48 and Biocarta (see URLs). DAVID 
generated enrichment scores for correlated pathways and Benjamini 
corrected P-values for individual pathways. PWM scan 49,50 was used to 
predict high-confidence transcription factor-binding sites in gene promo- 
ters. We then identified transcription factor-binding site over-represented 
in the promoters of significantly differentially expressed genes by Monte 
Carlo simulation analysis. 

We compared genes differentially expressed following miR-137 over- 
expression to an analogous experiment by Hill et ol. 5 ^ and to a gene 
expression study in lymphoblastoid cell lines from schizophrenia cases and 
controls. 52 We evaluated the relevance of genes differentially expressed 
following miR-137 overexpression to pathways previously implicated in the 
etiology of schizophrenia: expert-curated lists of genes found at synapses 
(1043 genes), 53 genes encoding proteins found at the postsynaptic density 
(1457 genes), 54 genes in neuronal activity-regulated cytoskeleton-asso- 
ciated protein postsynaptic signaling complexes (28 genes), 55 N-methyl-D- 
aspartate receptors (61 genes), 55 L-voltage-gated calcium channel subunits 
(26 genes), 6 genes whose proteins interact with calcium channel (Cav2, 
207 genes) 56 and genes whose mRNAs interact with FMRP (fragile X mental 
retardation protein, 842 genes) 57 

Finally, we assessed the overlap of genes differentially expressed 
following miR-137 overexpression to schizophrenia GWAS results. This 
analysis is a relatively direct evaluation of the salience of sets of genes 
directly or indirectly regulated by miR-137 to genetic variation implicated 
in the etiology of schizophrenia. We used INRICH 58 to test whether 
differentially expressed genes were enriched for smaller GWAS P-values in 
the largest published schizophrenia GWAS 6 (Supplementary Methods). 



RESULTS 

We transduced human neural precursor cells (ReNcell-VM) with 
lentiviral vectors expressing either miR-137 or a scrambled miRNA- 
like control. We verified infection rates >99% via fluorescent 
imaging of the eGFP tag in the vector. Alignment of RNA-seq 
reads to the lentiviral sequence confirmed correct expression of 
the experimental vector in each sample. 

Verification 

We verified increased miR-137 expression in two ways. qPCR 
confirmed marked increases in the amounts of mature miR-137 at 
24 h (1 1 1 x increase) and 48 h (548x increase) in cells exposed to 
lentivirus expressing miR-137 compared with controls 
(Supplementary Figure S1A). We used RNA-seq to estimate 
expression of the primary poly-adenylated MIR137 transcript from 
the miR-1 37 lentiviral vector and the MIR137 locus. Consistent with 
the qPCR results, we found 123x increase in the primary transcript 
at 24 h and 660x increase at 48 h (Supplementary Figure S1B). 

As miR-137 can effect neuronal proliferation and 
differentiation, 1 2-14,59 we evaluated cell density, morphology and 
expression of differentiation markers in the same flasks used for 
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RNA-seq. There were no significant differences in cell densities 
between miR-137 overexpression and control flasks at 24 (P = 0.53) 
or 48 h (P = 0.78), although the 48-h flasks had higher cell densities 
than at 24 h (P = 7.6x10 -9 , as expected because of additional 
growth time). We noted no qualitative differences in cellular 
morphology between the experimental conditions at either time 
point. We used RNA-seq to compare expression of 12 genes 
whose protein products are commonly used to classify neuronal 
cell types (Supplementary Table S1). There were no significant 
differences between the miR-137 overexpression and control 
conditions. The overall pattern of results is consistent with that of 
a neuronal stem cell. Thus, large increases in expression of 
miR-137 did not lead to clear alterations in cell fate after 24 or 
48 h. 

We verified the downregulation of predicted targets of miR-137 
in two ways. First, we applied Sylarray 42 to the RNA-seq data to 
evaluate fold change differences associated with all known miRNA 
seed sequences. Genes with the miR-137 seed sequence 
(AGCAATTA) in the 3'-untranslated repeat evidenced the greatest 
downregulation of any seed sequence (Supplementary Figure S1C, 
minimum P=1.1x10~ 22 in the 24-h condition and minimum 
P = 2.4x10 -45 in the 48-h condition). As a complementary test, 
we used permutation to evaluate differential gene expression in 
1115 genes with predicted miR-137 target sites (TargetScan 
v6.2). 43 Genes with a predicted miR-137 target site were 
significantly more likely to be differentially expressed 
(Supplementary Figure SID, P<10~ 200 at both 24 and 48 h), 
and most were downregulated at 24 (82%) and 48 h (87%). These 
results are consistent with profound and relatively specific 
downregulation resulting from miR-137 overexpression. 
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General expression patterns 

Supplementary Table S2 shows patterns of RNA-seq analyzability 
at 24 and 48 h. After removing transcripts that had insufficient 
sequencing depth, low expression levels or technical faults and 
resolving genes reported more than once, there were 14 682 
transcripts analyzable at both 24 and 48 h. The correlations 
between miR-137 overexpression and control conditions were 
0.999 at both 24 and 48 h, and the correlations between the 24 
and 48 h conditions were 0.980-0.982 (Supplementary Figures S2a 
and S2b). We applied principal component analysis to the 
expression data matrix (Supplementary Figure S3). The first 
principal component accounted for 97% of the variance and 
completely distinguished the replicates from 24 and 48 h. These 
descriptive results suggest the comparability of the RNA-seq data 
within and between time points. The results also suggest that only 
a fraction of transcripts was altered by the miR-137 experimental 
manipulation. 

Differential gene expression 

Supplementary Figure S4 shows descriptive statistics for log 2 fold 
changes, P-values and g-values for the 24 and 48 h analyses. These 
analyses compare the miR-137 overexpression to a control vector 
within each time point. At 24 h, 14 869 unique genes were 
analyzable and 202 (1.36%) had evidence of differential gene 
expression (q < 0.05, Supplementary Table S3). Of the differentially 
expressed genes, 130 genes (64.4%) had evidence of inhibition of 
gene expression following exposure to markedly elevated 
amounts of miR-137 and 72 genes (35.6%) showed enhancement. 
At 48 h, 428 (2.71%) of 15 780 analyzable and unique genes had 
evidence of differential expression (q < 0.05, Supplementary Table 
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Figure 1. Parallel plots of fold change values for the 1 4 694 analyzable genes at both 24 and 48 h. The four graphs are for each combination of 
significance at 24 and 48 h time points {q < 0.05). In each graph, the x axis shows fold changes (FCs) at 24 and 48 h time points, log 2 (FC 24 ) and 
log 2 (FC 48 ). The y axis is log 2 FC with positive values indicating upregulation upon overexpression of miR-137 and negative values the 
opposite. Each line connects the log 2 FC values for a gene. Large FC values may not be statistically significant (can occur when expression 
levels are low). All lines in a are colored gray. For b-d, red lines indicate inhibition at both 24 and 48 h, green lines increased expression at 
both 24 and 48 h and grey lines show genes whose log 2 FC values switched signs between 24 and 48 h. 
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Table 1. DAVID pathway analysis of differentially expressed genes {q < 0.1) 
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Abbreviations: EGF, epidermal growth factor; MHC, major histocompatibility complex. Pathway analysis from DAVID. As many annotated pathways contain 


overlapping genes, DAVID combines such pathways into clusters. Column 4 shows our labels for these clusters. Individual pathways are reported in 
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S4). Of these, 157 genes (36.7%) showed inhibition after miR-137 
overexpression and 271 genes (63.3%) showed enhancement. 

Review of the read count data showed no instances where 
miR-137 overexpression acted as a switch at either time point 
(that is, high expression in the control condition and very low 
expression with miR-137 overexpression). Differentially expressed 
genes had subtler effects with transcriptional changes ranging 
from 1.8-fold downregulation to 1.8-fold upregulation at 24 h and 
from 2.9-fold downregulation to 2.8-fold upregulation at 48 h 
(only 12 differentially expressed transcripts had fold changes in 
excess of 2). Thus, despite substantial increases in miR-137 
concentrations, the effects were quantitative not absolute and 
consistent with a role for miR-137 acting to fine-tune gene 
expression. 

Figure 1 depicts full results from this study and lists of 
differentially expressed genes are in Supplementary Tables S3 
and S4. The results indicate the complexity of the effects of 
miR-137 overexpression with time. For example, of transcripts that 
were differentially expressed at both 24 and 48 h, only 44.1% 
(15/34) had the same direction of effect each time. 

We asserted above that the widely used, computationally 
derived TargetScan (v6.2) list of miR-137 targets is imperfect. The 
presence of a predicted TargetScan-binding site was associated 
with significant (q < 0.05) differential gene expression at 48 h 
(odds ratio 4.0, 95% confidence interval (CI) 3.2-5.1, 
P = 2.7x10 -33 ) and, as expected from the analyses above, this 
was mainly because of inhibitory effects of miR-137 overexpres- 
sion (odds ratio = 10.6, 95% CI 7.6-14.6, P = 6.4x10" 69 ) rather 
than enhancement (odds ration 1.2, 95% CI 0.7-2.27, P = 0.52). 
However, these strong statistical associations were not robustly 
predictive as only 8.9% (87/1002) of analyzable genes with 
predicted miR-137 target sites were differentially expressed and 
only 20.3% (87/428) of differentially expressed transcripts had 
predicted miR-137 target sites. The direct and indirect effects of 



miR-137 thus appear to be complex and not robustly captured 
using TargetScan. 

Pathway analyses 

As the effects of miR-137 overexpression influenced multiple 
genes and never acted as a switch, we attempted to identify 
biological themes that could unify the effects of miR-137 across 
many different transcripts. We used a more liberal definition of 
differential expression in these analyses {q < 0.1 instead of 0.05). 

We used DAPPLE 44 to assess whether the protein products of 
genes with significant differential expression had greater protein- 
-protein interaction than expected by chance. We found 
significantly more interactions at 24 h (P = 0.01) and at 48 h 
(P = 0.001), and many of these genes formed a single network at 
each time point (Supplementary Figure S5). This result is 
consistent with the hypothesis that miR-137 influences multi- 
component functional networks rather than targeting of unrelated 
individual genes. 

We applied pathway analysis to lists of differentially expressed 
genes using DAVID (Table 1) 45,46 At 24 h, genes with inhibition 
after exposure to markedly increased amounts of miR-137 
(presumably enriched for direct targets of miR-137) were enriched 
for pathways related to extracellular matrix, positive regulation of 
transcription, neuronal differentiation/projection and extracellular 
signaling (Supplementary Table S5a). Genes with increased 
expression at 24 h (presumptive indirect targets of miR-137) were 
enriched for cell cycle consistent with prior reports 
(Supplementary Table S5D). 1 5,27,59 In the 48-h data, pathway 
analysis of differentially expressed genes yielded more changes. 
Interestingly, cell cycle pathways (upregulated at 24 h) were 
strongly downregulated at 48 h (Supplementary Table S6a). 
Pathways involved in DNA replication and cell cycle processes 
were also downregulated at 48 h. Upregulated pathways at 48 h 
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Table 2. Analyses of pathways implicated by schizophrenia genome-wide studies 
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Abbreviations: ARC, neuronal activity-regulated cytoskeleton-associated protein postsynaptic signaling complexes; Cav2, voltage-gated calcium channel; 
CI, confidence interval; DE, differentially expressed; FMRP, fragile X mental retardation protein; NMDA, N-methyl-D-aspartate; OR, odds ratio. Boldface shows 
P<0.01. 



included strong enrichment for genes involved in multiple types 
of cell-cell interactions, the major histocompatibility complex 
class II antigens (a region containing the most significant GWAS 
association for schizophrenia), 5,6,60,61 synapses and ion channels/ 
calcium binding (Supplementary Table S6b). 

Positive regulators of transcription were the second most 
enriched cluster from the pathway analyses (Table 1), and four of 
the most significantly enriched individual pathways from DAVID 
involved transcriptional regulation (Supplementary Table S5a). 
One of the top associations arising from schizophrenia GWAS is 
TCF4, a transcription factor that contains two known miR-137- 
binding sites. 33 These results suggest that the effects of miR-137 
can be extended markedly by altering the regulation of 
transcription factors. To explore this possibility, we obtained a 
list of 1745 genes encoding transcription factors from Transfac. 62 
At 24 h, 29 different transcription factors were downregulated 
(<7<0.1), slightly more than chance expectations (P = 0.01), 
whereas there was no over-representation of transcription factors 
in the upregulated genes (P = 0.61). At 48 h, genes encoding 
transcription factors were less likely to be upregulated data 
(P = 8.2x 10~ 4 ). These results are consistent with miR-137 acting 
as an amplifier via directly fine-tuning the regulation of 
transcription factors, which then leads to secondary regulatory 
effects. TCF4 is a known direct target of miR-137 33 and has been 
implicated in schizophrenia GWAS. 5,6 Although TCF4 was not itself 
differentially expressed, we found enrichment for TCF4-bindings 
sites in the promoters of genes with significant differential 
expression at 48 h (P = 0.003), including both up- (P = 0.007) and 
downregulation (P = 0.01). These results highlight the evident 
complexity of miR-137 gene regulation and that effects on true 
direct targets may not be detectable (due to transcriptional 
repression or subtle effects) and yet influence transcription for 
multiple genes. 

Genomic studies of schizophrenia have implicated biological 
pathways using multiple types of genomic data (common 
variation, rare CNVs and rare exonic variation). 3,6,55,63 It is possible 
that genes influenced by miR-137 alter risk by directly or indirectly 



targeting pathways central to pathogenesis. We thus evaluated 
whether pathways implicated in schizophrenia were enriched for 
transcripts with differential expression consequent to exposure to 
increased amounts of miR-137. First, pathways including sets of 
synaptic genes have been implicated in schizophrenia. Expert- 
curated lists of synaptic genes are enriched for smaller GWAS 
P-values, 6,63 and genes encoding postsynaptic density proteins, 
N-methyl-D-aspartate receptors and activity-regulated cytoskele- 
ton-associated protein complexes (activity-regulated 
cytoskeleton-associated protein) have been implicated in schizo- 
phrenia by studies of de novo CNV, de novo deleterious exome 
variation and deleterious exome variation in cases and 
controls. 3,4,55 We tested for associations of these pathways with 
genes with differential expression at 24 and 48 h (Table 2). None of 
these pathways were associated in the 24-h data but three 
showed significant associations in the 48-h data (all but N-methyl- 
D-aspartate receptor genes). The associations for synaptic genes 
were particularly strong. Intriguingly, the 48-h associations tended 
to be in differentially expressed genes showing upregulation. 
Second, genes encoding RNAs that interact with FMRP (the 
protein product of FMR1) 57 are enriched for smaller GWAS 
P-values and deleterious exon mutations in schizophrenia. 3 We 
found significant association between genes whose RNAs interact 
with FMRP with upregulated genes at 48 h, supporting the idea 
that RNAs bound by FMRP are influenced by miR-137. Third, 
neuronal calcium channel signaling has been implicated in 
schizophrenia and bipolar disorder 6,6,64 We found significant 
associations between calcium channel subunits with genes 
upregulated at 48 h following exposure to markedly increased 
amounts of miR-137. The analogous association with genes whose 
protein products were identified as interacting with Ca v 2 (voltage- 
gated calcium channel) was nominally significant. 56 Overall, there 
was little evidence of association between genes differentially 
regulated by miR-137 at 24 h but relatively consistent associations 
with upregulation at 48 h. It is plausible that indirect/upregulation 
effects require longer exposure to increased levels of miR-137; this 
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Table 3. Enrichment of schizophrenia GWAS results and genes with 
differential expression after overexpression of miR-137 



Time DE genes (q<0.1) Enrichment P-value 



24 h All >0.1 

Downregulation >0.1 

Upregulation >0.1 

48 h All 0.014 

Downregulation 0.005 
Upregulation 0.01 1 



Abbreviation: GWAS, genome-wide association study. P-values from INRICH 
evaluating the enrichment of lower GWAS P-values (P<0.01) 6 to 
differentially expressed genes (<7<0.1) from miR-137 overexpression. The 
extended major histocompatibility complex region (chr6:25-34mb) was 
excluded due to its high gene density, extended linkage disequilibrium 
and strong GWAS signal in schizophrenia. Boldface shows P<0.01. 



supports a role for miR-137 as an effector that indirectly regulates 
pathways critical to schizophrenia. 

We compared these results with those of a similar experiment 
by Hill et al. 5 ^ (~1 000-fold increase in miR-137 expression in the 
neural cell line CTXOE03, gene expression measured at 72 h using 
lllumina HT-12v4 arrays). For the genes present in both studies, 
there was a significant negative correlation in the log 2 fold change 
values at 24 h (Spearman p = -0.33, P = 5x10~ 19 ) and a 
significant positive correlation at 48 h (Spearman p = 0.25, 
P = 3x10 -11 ). Despite the use of a different vector, cell line and 
expression technology, these results are surprisingly comparable 
to these in our experiment. However, there was no significant 
overlap between our results and genes that were differentially 
expressing in lymphoblastoid cell lines from schizophrenia cases 
and controls (data not shown). 52 

Schizophrenia GWAS results 

Previous work has shown significant enrichment of smaller 
schizophrenia GWAS P-values for predicted targets of miR-137. 5,6 
However, these analyses were based on TargetScan-predicted 
direct effects of miR-137 which, as described above, have 
incomplete overlap with the biological direct and indirect effects 
of miR-137 overexpression in human neural precursor cells. We 
thus compared the empirical findings from this report with results 
from the largest published schizophrenia GWAS (Table 3)7 There 
was no significant enrichment for genes differentially expressed at 
24 h. However, there was significant enrichment of smaller GWAS 
P-values for downregulated targets of miR-137. 

DISCUSSION 

miR-137 is a particularly intriguing association for schizophrenia, 
given the strength of its association and the co-occurrence of 
additional associations in many of its predicted targets. These 
empirical findings from unbiased genomic screens are consistent 
with the hypothesis that a biological or developmental process 
orchestrated by miR-137 contributes to the etiology of schizo- 
phrenia. As we demonstrated, a key limitation of our current 
knowledge is that we have only an approximate idea of the genes 
influenced by miR-137 (computational prediction of direct targets 
using TargetScan and almost nothing on its indirect targets). 

As an initial approach to understanding the reach of miR-137, 
we overexpressed miR-137 in human neuronal stem cells to detect 
genes whose transcripts showed differential expression upon 
exposure to large increases in amounts of miR-137. Notably, our 
results had approximate overlap with a similar study 51 Using a 
number of verification analyses, we demonstrated that we 
successfully increased the amounts of miR-137 and that these 
effects were relatively specific. Our approach allowed detection of 
direct targets influenced by RNA degradation plus secondary 
effects in the form of increased expression. Via identification of 
individual genes and larger-scale alterations in biological path- 
ways, our findings provide a window into the cellular mechanisms 
regulated by miR-137 in neuronal stem cells. Moreover, the 
different patterns of findings at 24 and 48 h highlight the dynamic 
regulation of miR-137, and emphasize that findings at a single 
time point may not encompass all targets of a given miRNA. In 
aggregate, these admittedly circumstantial results increase sup- 
port for a role of miR-137 in schizophrenia. 

First, the direct and indirect targets of miR-137 that we 
identified had relatively small changes in expression, typically 
less than twofold. We identified no instances where miR-137 acted 
as an on- or off-switch. These findings are consistent with prior 
studies showing that microRNAs yield subtle changes in many 
genes. 10,11 Moreover, many differentially expressed genes were 
not predicted targets of miR-137, and many predicted targets are 
not altered. We do not believe that this was be due to off target 
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effects or secondary effects on different microRNAs, given that 
Sylarray detected only enrichment in miR-137 seed sequences 
(Supplementary Figure S1C). 

Second, miR-137 has been shown to affect proliferation and 
differentiation. 14,25,26,59 We found that cell density, morphology 
and neuronal fate markers were not altered by miR-137 at 24 or 
48 h. Given that cell cycle genes showed differential expression 
(Table 1), it is possible that longer periods of observation would 
have detected alterations in morphology or cell fate. 

Third, our results were consistent with the coordinated 
influence of miR-137 on multiple sets of functionally intercon- 
nected genes (for example, via the significant enrichment of 
protein-protein interactions in the differentially expressed genes 
and the pathway analyses). These results support the hypothesis 
that miR-137 causes subtle differential regulation of multiple 
genes in interrelated pathways, leading to larger physiological 
changes to a cell or organism. 

Fourth, as summarized above, highly significant schizophrenia 
GWAS findings near MIR137 and enrichment of its predicted 
targets support a role of MIR137 in schizophrenia. However, the 
region of association includes other candidate genes {DPYD and 
MIR2682). Critically, genes influenced by miR-137 in a human 
neural stem cell line (particularly downregulated genes) were 
enriched for smaller GWAS P-values (Table 3). Moreover, pathway 
analyses provide supporting data. Genes upregulated at 48 h 
(presumptive indirect targets) were consistently enriched for 
genes and pathways related to schizophrenia (Tables 1 and 2). 
These included significant overlap with major histocompatibility 
complex class II genes (encompassed by the strongest schizo- 
phrenia finding to date), genes with TCF4 (a schizophrenia risk 
locus and known target of miR-137) promoter-binding sites and 
pathways previously implicated in schizophrenia (synaptic genes, 
genes whose mRNAs interact with FMRP and L-type voltage-gated 
calcium channels). 

These results should be interpreted in the context of several 
limitations. First, we are cautious in our comparison of results 
between 24 and 48 h, given the change to a different method of 
RNA extraction. Second, we attempted to sequester miR-137 as an 
inhibition control but were unable to obtain technically satisfac- 
tory results. Third, the degree of overexpression of miR-137 was 
continual and far greater the naturally occurring levels of 
expression. 

In summary, our results have identified genes and pathways 
regulated through miR-137, supporting a role of miR-137 in fine- 
tuning neuronally important pathways including cell cycle 
regulation, L-type voltage-gated calcium channels, cell-cell 
interactions and synaptic function. These results provide empirical 
data supporting the hypothesis that direct and indirect miR-137 
targets have roles in the etiology of schizophrenia. 
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